Measurement Study of Shared Content and User Request Structure in Peer-to-Peer Gnutella Network

نویسندگان

  • Przemyslaw Makosiej
  • German Sakaryan
  • Herwig Unger
چکیده

The following contribution presents the analysis of shared content distribution and user request structure in Gnutella peer-to-peer (P2P) network. Since the keyword search is the main approach to locate content in P2P systems, the shared content and user requests were studied from keywords perspective. More particularly, the keyword distributions in filenames, among peers and in user queries are studied and analytically represented. It was shown that the distribution of keywords among peers and in user queries do not follow Zipf’s distribution law, which is typical for many Internet distributions. The content of the peers was analyzed to discover similarity patterns between different peers. It has been demonstrated that all analyzed peers follow one of five main similarity patterns. In order to do so, the keyword-oriented similarity metric was proposed. In addition, the file distribution across a network was investigated and analyzed to discover replication law, willingness of peers to share files and the demographics of a network. This is intended to compare results with early presented works.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Relating Query Popularity and File Replication in the Gnutella Peer-to-Peer Network

In this paper, we characterize the user behavior in a peer-to-peer (P2P) file sharing network. Our characterization is based on the results of an extensive passive measurement study of the messages exchanged in the Gnutella P2P file sharing system. Using the data recorded during this measurement study, we analyze which queries a user issues and which files a user shares. The investigation of us...

متن کامل

Estimating peer similarity using distance of shared files

Peer-to-Peer (p2p) networks are used by millions of users for sharing content. As these networks become ever more popular, it becomes increasingly difficult to find useful content in the abundance of shared files. Modern p2p networks and similar social services must adopt new methods to help users efficiently locate content, and to this end approximate meta-data search and recommendation system...

متن کامل

Mining Musical Content from Large-Scale Peer-to-Peer Networks

Peer-to-Peer (p2p) networks are an invaluable resource for various multimedia information retrieval (MIR) tasks, such as user and song similarity, recommendation and trend prediction, mainly due to their size and ability to capture user preferences. This paper presents a study performed on musical content collected from the a large scale p2p network. Using the song files shared by users in the ...

متن کامل

Zone Based Peer-to-Peer

Peer-to-Peer (P2P) networks without central entities, such as Gnutella or JXTA, generally suffer under a high signaling load resulting in poor efficiency. The main reason therefore is the necessity to flood requests in the overlay, since in most P2P protocols the nodes are not provided with any information about the P2P overlay network topology. This paper therefore addresses this application-l...

متن کامل

A measurement study supporting P2P file-sharing community models

1389-1286/$ see front matter 2008 Elsevier B.V doi:10.1016/j.comnet.2008.11.007 * Corresponding author. Tel.: +39 011 6706718. E-mail address: [email protected] (M. Sereno). Knowledge of emergent properties of existing peer-to-peer file-sharing communities can be helpful for the design and implementation of innovative peer-to-peer protocols/services that exploit autonomicity, self-configuratio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004